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3 <110> APPLICANT: Gellissen, Gerd 

4 Braus, Gerhard 

5 Pries, Ralph 

6 Krappmann, Sven 

7 Strasser, Alexander 

9 <120> TITLE OF INVENTION: Nucleic Acid Molecule Comprising a Nucleic Acid Coding 

Polypeptide 

10 with Chorismate Mutase Activity 

12 <130> FILE REFERENCE: 029474-5007-00 

14 <140> CURRENT APPLICATION NUMBER: 10/042059B 

15 <141> CURRENT FILING DATE: 2001-10-25 

17 <150> PRIOR APPLICATION NUMBER: DE 199 19 124.7 

18 <151> PRIOR FILING DATE: 1999-04-27 
20 <160> NUMBER OF SEQ ID NOS : 7 

22 <170> SOFTWARE: Patentln version 3.1 
24 <210> SEQ ID NO: 1 
2 5 <211> LENGTH: 843 

26 <212> TYPE: DNA 

27 <213> ORGANISM: Hansenula polymorpha 

2 9 <220> FEATURE: 

30 <221> NAME/KEY: CDS 

31 <222> LOCATION: (1)..(843) 

32 <223> OTHER INFORMATION: 

3 4 <400> SEQUENCE: 1 
3 5 atg gac ttt atg aag 
3 6 Met Asp Phe Met Lys 
37 1 5 

3 9 gat gcc ttg gtc egg 

40 Asp Ala Leu Val Arg 

41 20 
43 egg teg cag ttc tat 

4 4 Arg Ser Gin Phe Tyr 
45 35 
4 7 cct att ccc aac ttc 
4 8 Pro lie Pro Asn Phe 
49 50 

51 cac gag cga ate cat 

52 His Glu Arg lie His 

53 65 

55 gtg cct ttt ttc ccc 

56 Val Pro Phe Phe Pro 

57 85 

59 aac tac cca teg gtg 

60 Asn Tyr Pro Ser Val 



^u^r^^^u^xr^inAnix^u faf™ 91121111 



cca gaa aca gtg 
Pro Glu Thr Val 

atg gag gat acg 
Met Glu Asp Thr 
25 

gcg teg ccc teg 
Ala Ser Pro Ser 
40 

gac ggc teg ttc 
Asp Gly Ser Phe 
55 

teg cag gtg agg 
Ser Gin Val Arg 
70 

aac gtg ctg gaa 
Asn Val Leu Glu 

eta gcc tec tac 
Leu Ala Ser Tyr 



ctg gac ctt ggc aac 
Leu Asp Leu Gly Asn 
10 

ate ate ttc aac ttt 
lie lie Phe Asn Phe 
30 

gta tac aaa gtc aac 
Val Tyr Lys Val Asn 
45 

ttg gac tgg ctg ttg 
Leu Asp Trp Leu Leu 
60 

aga tac gac gcg cca 
Arg Tyr Asp Ala Pro 
75 

aaa acg ttt ctg ccc 
Lys Thr Phe Leu Pro 
90 

gcg gat gaa ate aac 
Ala Asp Glu He Asn 



att aga 48 

He Arg 

15 

ate gag 96 
He Glu 

cag ttc 144 
Gin Phe 

teg cag 192 
Ser Gin 

gac gag 240 
Asp Glu 
80 

aag ate 288 

Lys He 

95 

gtc aac 336 
Val Asn 



I 
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100 Asp Glu Glu Asp Asp Asp Ala Thr Gin Lys Ser Gly Gly Tyr Val Asp 

101 260 265 270 

103 egg ttt etc tec tct ggc ttg tac tag 843 

104 Arg Phe Leu Ser Ser Gly Leu Tyr 

105 275 280 

108 <210> SEQ ID NO: 2 

109 <211> LENGTH: 280 

110 <212> TYPE: PRT 

111 <213> ORGANISM: Hansenula polymorpha 
113 <400> SEQUENCE: 2. 

115 Met Asp Phe Met Lys Pro Glu Thr Val Leu Asp Leu Gly Asn He Arg 

116 15 10 15 

119 Asp Ala Leu Val Arg Met Glu Asp Thr He He Phe Asn Phe He Glu 

120 20 25 30 

123 Arg Ser Gin Phe Tyr Ala Ser Pro Ser Val Tyr Lys Val Asn Gin Phe 

124 35 40 45 

127 Pro He Pro Asn Phe Asp Gly Ser Phe Leu Asp Trp Leu Leu Ser Gin 

128 50 55 60 

131 His Glu Arg He His Ser Gin Val Arg Arg Tyr Asp Ala Pro Asp Glu 

132 65 70 75 80 
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187 <210> SEQ ID NO: 3 

188 <211> LENGTH: 1655 

189 <212> TYPE: DNA 

190 <213> ORGANISM: Hansenula polymorpha 
192 <220> FEATURE: 

19 3 <221> NAME/KEY: gene 

194 <222> LOCATION: (1)..(1655) 

195 <223> OTHER INFORMATION: 1,8 kb genomic DNA 

198 <220> FEATURE: 

199 <221> NAME/KEY: gene 

200 <222> LOCATION: (1)..(1655) 

201 <22 3> OTHER INFORMATION: 1,8 kb genomic DNA 
204 <400> SEQUENCE: 3 

20 5 cccggcccaa tgccagcaat atggagacgt ttaggcagaa 
207 cgctgcttgt tgccaccgga atatacaccg cattgeagtt 
209 acgattacat tggcggaacg tatcgegagt cgctcacgag 
211 aatcgcgaaa cgaccttata gaegcaegtg aaaactaegg 
213 agegaatcca gcggtttttg tggttcagac atetttegtg 
215 gaacttgagg agcgtttttt ttttcctgtt tagtttttgt 
217 cagaaacagt gctggacctt ggcaacatta gagatgeett 
219 tcatcttcaa ctttatcgag eggtegcagt tetatgegtc 
221 accagttccc tattcccaac ttcgaegget cgttcttgga 
223 agegaatcca ttcgcaggtg aggagatacg acgcgccaga 



-fragment from Hanseula polymorpha 



-fragment from Hansenula polymorpha 

taggegttec atacttctca 60 

tgcacacatc atactatatg 120 

aegcattaga atgacagaga 180 

gtttggaggc agcaaggagg 240 

gcttttaggc gaggataagc 300 

aggtatggac tttatgaagc 360 

ggtccggatg gaggatacga 420 

gccctcggta tacaaagtca 480 

ctggctgttg tcgcagcacg 54 0 

egaggtgect tttttcccca 600 
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225 acgtgctgga aaaaacgttt ctgcccaaga tcaactaccc atcggtgcta gcctcctacg 660 

227 cggatgaaat caacgtcaac aaagagatac tcaagatcta cacgtcagag atagtaccag 720 

229 gaatagctgc aggcagcgga gagcaggagg acaaccttgg ctcgtgcgca atggccgaca 780 

2 31 tcgagtgcct gcagtcgcta tccagaagaa tccattttgg ccgttttgtc gcagaggcta 84 0 

233 aatttatcag tgagggggac aagattgtgg atctgatcaa aaagagagat gtggaaggca 900 

235 ttgaggcgct catcacaaac gccgaggtcg aaaaacggat cttggacaga cttctggaga 960 

237 agggaagggc gtatggaaca gacccgacac taaagttcac gcagcacatt cagagcaagg 1020 

239 tgaagcccga ggtgattgtg aaaatctaca aggatttcgt gattccgctc acgaagaagg 1080 

241 tcgaagtcga ctacttgctg agacggctgg aggacgagga ggacgatgat gcgacgcaga 114 0 

243 aaagcggcgg ctacgttgac cggtttctct cctctggctt gtactagaaa ttaaaatttt 1200 

245 cagtacttta attattctcg aattctagtt cagataccgc atggtaattt caaaggccag 1260 

247 aaaagtggcc gcgttggctg gggcagctct cagaatagtc ggcgagaatc ctttgactag 1320 

249 cccccaggca ccgctctgtc tccaaatacc cctaatagtc tcaacagcat ttctataaac 1380 

2 51 cagcttcttg tagttgtccg tctgcatgtt ggacttgatc acatcgatcg gataaatact 144 0 

253 gaaccacatc ccgtaacctg ccagcgcccc aaagacgcag agcttccagt tctcgatgtc 1500 

255 cttcctggca atattccgcg actcgatctc gtttttcacg agagcttcaa aagtcagaaa 1560 

257 atacgctccg ctacccaaac tttctcttgc cagcgtaggt cccagacccc ggtagattaa 1620 

259 cttgatgcct cccgtatggt acagcttctt gatcc 1655 

262 <210> SEQ ID NO : 4 

263 <211> LENGTH: 20 

264 <212> TYPE: DNA 

C--> 265 <213> ORGANISM: Artificial 

267 <220> FEATURE: 

268 <223> OTHER INFORMATION: Oligonucleotide 

270 <400> SEQUENCE: 4 

271 aattaaccct cactaaaggg 20 

274 <210> SEQ ID NO: 5 

275 <211> LENGTH: 22 

276 <212> TYPE: DNA 

C--> 277 <213> ORGANISM: Artificial 

279 <220> FEATURE: 

280 <223> OTHER INFORMATION: Oligonucleotide 

282 <400> SEQUENCE: 5 

283 gtaatacgac tcactatagg gc 22 

286 <210> SEQ ID NO: 6 

287 <211> LENGTH: 26 

288 <212> TYPE: DNA 

C--> 289 <213> ORGANISM: Artificial 

291 <220> FEATURE: 

292 <223> OTHER INFORMATION: Oligonucleotide 

294 <400> SEQUENCE: 6 

295 atatagatct acaaaaacta aacagg 26 

298 <210> SEQ ID NO : 7 

299 <211> LENGTH: 28 

300 <212> TYPE: DNA 

C--> 301 <213> ORGANISM: Artificial 

303 <220> FEATURE: 

304 <223> OTHER INFORMATION: Oligonucleotide 
306 <400> SEQUENCE: 7 
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307 atatagatct gatgcgacgc agaaaagc 



28 
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Sidles 1 require^ at a line not exceed 72 characters in length. This includes spaces, 

Seq#:l; Line(s) 9 
Invalid <213> Response; 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 



Seq#:4, 5,6,7 



4 t 
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L:265 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 4 

L:277 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 5 

L:289 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 6 

L:301 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:7 



